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Abstract Male large odorous frogs (Odorrana graminea) 
have been shown to have a rich vocal repertoire and their 
calls contain harmonics that extend into the ultrasonic 
range, much like calls of male O. tormota, a sympatric 
species. Whereas vocalizations of male O. tormota exhibit 
nonlinear characteristics and contain individual signatures, 
it is unclear whether or not those of O. graminea do. We 
carried out an extensive study of the vocalizations of male 
O. graminea with the goal of investigating whether they: 
(1) contained nonlinear phenomena (NLP), (2) carried 
individual signatures. We recorded and analyzed 1770 
vocal calls from nine males. Their calls were classifiable into 
two distinct categories (one-note calls and multi-note calls). 
Of these, one-note complex/steep-frequency modulation 
call was the most abundant call-type. These calls showed 
diverse frequency modulation patterns, and abundant 
presence of NLP. Various basic spectral and temporal 
parameters of such calls were individually distinct. 
Kruskal-Wallis Analysis of Rank test showed that the calls 
from individual males differed significantly. Subsequent 
discriminant function analysis (DFA) showed that 54.7% 
of the calls could be correctly assigned to individual. These 
results suggested that male O. graminea might be able to 
discriminate neighboring frogs from strangers acoustically. 
However, further research is necessary to demonstrate this 
capacity behaviorally. 
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1. Introduction 


The Large Odorous frog (Odorrana graminea) is an arboreal 
species sympatric with O. tormota inhabiting noisy streams and 
waterfalls in select regions of China (Chen, 1991). Their calls are 
classifiable into six basic call-types and most of them contain 
harmonics that extend into the ultrasonic range (Shen et al, 
2011). The most common call-type is frequency modulation 
(FM) calls with overt frequency modulation throughout the 
duration of the call. 

A notable difference between the vocalizations of O. 
graminea and O. tormota is the occurrence of nonlinear 
phenomena (NLP). NLP refers to four acoustic characteristics 
not found in vocalizations produced by linear sound production 
systems, ie, frequency jump, subharmonic, deterministic chaos, 
or biphonation (Wilden et al., 1998). Frequency jump refers to an 
abrupt or rapid change within a few milli seconds, in the call 
fundamental frequency (f,); subharmonics refers to harmonic 
segments of a call where there is energy at 0.5, 0.33, 0.25, 0.2, or 
0.166 fa; deterministic chaos refers to segments of a call with 
noise-like spectra; biphonation refers to a bifurcation of the f, 
or of the overtones. Whereas 92% of long calls of male (Feng et 
al, 2009b) and 39% of female O. tormota vocalizations (Zhang et 
al, 2017) contain at least one NLP component, there is so far no 
evidence of NLP in the vocalizations of male O. graminea for 
any of the reported call-types (Shen et al., 2011). 

During our longitudinal study of the vocalizations of 
male and female O. tormota in Huangshan (Anhui, China) 
over the last eight years (Zhang et al. 2015, 2016, 2017, 2020), 
we have also collected a small sample of the vocalizations of 
male O. graminea as the two species show partial overlaps in 
their reproductive periods and calling sites. Analysis of these 
vocalizations revealed that some of their calls exhibit frequency 
jumps and/or subharmonics. Furthermore, our pilot field 
observations have also revealed that male O. graminea can 
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maintain the same territorial for a single reproductive season 
(territorial’s radius < 0.3 m) and territorial male commonly have 
three to five territorial neighbors. Meanwhile, several cases of 
male-male physical combat were also observed. Thus, male O. 
graminea appear to display “dear enemy” phenomenon (Fisher, 
1954; Temeles, 1994), a phenomenon frequently observed in 
various groups of animals especially in territorial animals, e.g. 
birds, mammals, fish, amphibians, and insects (Temeles, 1994; 
Briefer et al., 2008; Bee et al., 2016). Such animals discriminate 
familiar neighbors from unknown strangers using a variety of 
sensory cues, and they respond more aggressively to strangers. 
By recognizing neighbors and respecting established territorial 
boundaries, individuals can conserve time and energy that 
would otherwise be expended in unnecessary territorial defense 
(Falls, 1982; Temeles, 1994). It is possible that male O. graminea 
can discriminate familiar neighbors from strangers through 
their vocalizations. 

Results of these casual observations motivated us to carry 
out a thorough study to investigate whether the vocalizations 
of male O. graminea (1) contained NLP and (2) carried individual 
signatures which allowed them to discriminate neighbors from 
strangers acoustically. 


2. Materials and Methods 


2.1. Study site Field study was performed from 26 April 2015 
to 28 May 2015, in the village Fuxi (118°8'44.89”E, 30°5’L61’N; 
datum = WGS84; Elevation: 600 m), in Huangshan, China. The 
ambient temperature and humidity at the study site at evening 
hours ranged from 15°C to 20.5°C and from 72% to 100%, 
respectively. During the study period, male O. graminea emerged 
from their daytime hiding places to call nightly (from 1900 h to 
2400 h) from large boulders in the middle of select sections of 
a local stream (Fu Creek). The calling activity was dependent 
on the ambient temperature and humidity; chorus weakened 
substantially or vanished when the temperature dropped 
below 15°C, or when the humidity was < 70%. Their calling sites 
were unevenly distributed along Fu Creek and concentrated 
in the vicinity of waterfalls or rapid water streams where the 
noise was intense. The ambient noise levels at the frog calling 
sites were measured each night using an SPL meter (TES 1357; 
Tianjin Sheng Xing Tai Technology Co, Tianjin, China) and it 
ranged between 73.5 and 77.8 dB SPL RMS with A weighting. 


2.2. Marking During the first few nights of the study, we 
captured 17 actively calling frogs, and marked their calling sites 
in the field (sites were marked with small, numbered plastic 
tags attached to trees or shrubs). Captured frogs were kept in 
individual holding tanks and placed indoors overnight. On 
the second day of capture, these frogs were tagged individually 
using two different colors of visible implantable elastomer 
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(Northwest Marine Technology, Shaw Island, WA, USA; 
Moosman and Moosman, 2006; Feng et al., 2009b). The marker 
was implanted into the epidermal layer of the skin on their 
dorsal body surface, and/or fore- and hind-limbs. Marking was 
color and location coded for these frogs in order that individual 
frogs could later be identified in the field by shining UV light 
over the frog to visualize the individual-specific locations of 
fluorescent markers. Afterward, the frogs were released into the 
wild, to their individual calling sites. 

Subsequently, each night we searched for marked males 
throughout our study site (L x W: 10 m x 3 m) using a UV light. 
The calling sites of individual males were stable and remained 
essentially unchanged throughout the study period. Thus, most 
evenings many of the labeled frogs could be spotted at the same 
boulders where they were initially captured. Individuals’ calling 
rates fluctuated from one night to the next (0-80 calls), with 
some individuals maintaining a high calling rate across nights, 
eg. active callers might emit up to 80 calls in an evening (~ 1 
hour), whereas others were not so active in the chorus in some 
evenings. Frogs were positively identified before we recorded 
their vocalizations. 


2.3. Sound recording Frogs’ vocalizations were recorded using 
a digital audio recorder (Sound Devices 702, Sound Devices, 
WI, USA; frequency range: 10 Hz—96 kHz) and a miniature 
omni-directional condenser microphone (AKG model C417, 
AKG Acoustics, Vienna, Austria), using a sampling rate of 96 
kHz and 16-bit accuracy. The microphone has a flat frequency 
response over 20-20 000 Hz and a drop of 10 dB at 30 000 Hz. 
The microphone was attached to a bamboo pole and placed 
at a distance of 30 cm from the calling male. Recordings were 
carried out mainly between 19 : 00 h and 22: 30 h. 


2.4. Call analysis In light of the presence of NLP, such as 
frequency jumps, subharmonics, deterministic chaos and 
biphonations, analysis of the vocal signals of O. graminea 
followed that of Feng et al. (2009b). First, for each call we used 
SELENA, a custom-designed software (Narins et al., 2004; Feng 
et al., 2006, 2009b), to display its narrow-band spectrogram 
and determine the call duration. Similar to O. tormota (Feng et 
al, 2009b) and an earlier study in O. graminea (Shen et al, 2011), 
visual inspection of the spectra of all recorded vocalizations 
showed that calls of O. graminea could be categorized into 
distinct call-types on the basis of the call duration, the range of 
the call fundamental frequency (f,), the presence of NLP, and 
the number of notes in a call. 

As one-note complex/steep-FM calls were the predominant 
call-type and contained NLP components, we chose these as 
the focus of further analysis. Other call types, e.g. one-note flat- 
FM and multi-notes, were used for analysis in vocal diversity. 
Using PRAAT (Boersma and Weenink, 2011), we first identified 
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the different temporal segments of a call (Figure 1) on the 
basis of their dynamic regimes. Segmentation was based on 
visual inspection of a call’s narrowband spectrogram (Herzel, 
1993) and associated Fourier frequency spectra, following 
earlier studies (Riede et al., 2000, 2004; Titze et al., 2008; 
Zollinger et al, 2008; Feng et al, 2009b). We noted that calls of 
O. graminea had the following dynamic regimes: harmonic 
phonation, deterministic chaos, frequency jump, biphonation, 
subharmonics, and signal break (Riede et al., 1997; Wilden et 
al, 1998; Fitch et al, 2002; Feng et al., 2009b). To segment a call, 
we simply placed segment borders at the boundaries between 
the different dynamic regimes (Figure 1). Following call 
segmentation, we determined the time and rate of occurrence 
of each phenomenon and measured the durations of the 
various segments (ie, harmonic, subharmonic, deterministic 
chaos, frequency jump, biphonation). We additionally calculated 
these durations as percentages of the total call duration (ie, their 
relative durations). 


Freq. (kHz) 


Figure 1 Oscillogram (top trace) and spectrogram (bottom trace) of a 
one-note steep-FM call showing how the call is segmented temporally 
into deterministic chaos (Ch), harmonic (Ha), subharmonic (Sh), and fre- 
quency jump (Fj) segments. At Fj it shows an abrupt change in the call 
fo; at Sh it shows the presence of energy at 0.33 fo (the first Sh segment) 
and 05 f, (the second Sh segment); at Ch it displays noise-like spectra. 
This call does not show biphonation. The call was analyzed using Selena, 
a sound analysis software (See Methods). 


For each harmonic segment, we tracked the fo using the 
“pitch tracking” mode in PRAAT with 1-ms intervals. No fo 
measurement was made for NLP segments. During fo tracking, 
only the f, values that were perfectly aligned with the call’s 
narrowband spectrogram were taken. If the fọ track deviated 
from the first harmonic, the fo track was recomputed using 
different thresholds and octave cost values in PRAAT, until 
the deviation was completely eliminated. With tracking of fo 
we measured the mean, maximum and minimum fọ as well as 
the difference and ratio between the maximum and minimum 
fo Together, we measured a total of 14 temporal and spectral 
parameters for each call. 


2.5. Statistical analysis We first employed Shapiro—Wilk 
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test to determine whether the data for each sound parameter 
were normally distributed (ie, P > 0.05). It turned out that the 
data for many of the 14 sound parameters analyzed were 
not normally distributed, however, all data had the same 
shape distributions. Therefore, to determine whether or not 
vocalizations of individual males are distinct and contain 
individual vocal signatures, we carried out two different 
analyses. We first performed Kruskal-Wallis Analysis of 
Rank test, to determine whether vocal signals from different 
individuals were significantly different. For this, we included 
the data from 8 frogs from which we recorded > 40 calls each. 
To exclude the potential effect of ambient temperature on 
properties of vocalizations (Gerhardt 1978), data from frog 
#4 with an n of 16 were excluded as its calls were recorded 
from wide ranges of temperature (between 15 and 20.5 °C). 
To quantify call variability, within- and between-individual 
coefficient of variance of all acoustic parameter, ie, CVw and 
CV,, were calculated. Within-individual variability can be 
attributed to many factors, including the ambient temperature 
(Gerhardt 1978) and background noise level. As recordings 
from 8 frogs were performed within narrow ranges of 
ambient temperature (16-19 °C) and noise level (74-77 dB SPL), 
much of the variability represented the differences in frog’s 
neuromuscular control of the vocal apparati at different times. 
Subsequently, we carried out a stepwise discriminant 
function analysis (DFA) to predict group membership for 
each call. The result was a percentage documenting the correct 
assignment to individuals. For this, we removed several 
parameters that were correlated, e.g., absolute and relative 
durations of subharmonics. Out of the 14 sound parameters 
initially analyzed, we were thus left with 8 uncorrelated 
parameters (ie., total duration, duration of the harmonic 
segments, duration of the subharmonic segments, duration of 
frequency jump segments, duration of the biphonic segments, 
average fy, faMax, and f,Max—f,Min). As only 40 one-note 
steep-FM calls had been recorded from frog #16, to keep the 
sample size of the DFA more uniform (Feng et al., 2009b), we 
took the first 40 calls from each of the eight males, thus a total 
of 320 calls. As a requirement of DFA was that the data be 
normally distributed, we first employed Shapiro-Wilk tests to 
examine whether the specific data sets (n = 40) for each sound 
parameter from eight individuals were normally distributed, 
and if not whether the non-normality was attributed to a 
skewness in the distribution or to the presence of outliers. 
We found that about half of the data sets were normally 
distributed; the remainders were not. However, all but < 5% 
of non-normality was due to skewness in the distribution. A 
recent simulation study (Pohar et al., 2004) has shown that 
results of linear discriminant analysis (for data showing normal 
distribution) closely match results of logistic regression (for data 
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not showing normal distribution) if the non-normality is due 
largely to skewness of the distribution. As such, we chose to 
use DFA to determine group membership such that results of 
our analysis could be directly compared to those from earlier 
studies in other anuran species. 

The a priori probability of correct assignment by the DFA 
was based on the total number of groups. Thus, the probability 
of a call belonging to 1 of 8 males was 12.5%. However, this 
procedure did not take into account the characteristics of the 
specific data (Titus et al., 1984). Therefore, we calculated an 
alternative chance level on the basis of a randomized version 
of the actual data set (Solow, 1990). See http://www.random. 
org for a random sequence of the numbers from 1 to 320, using 
“sequence generator” function provided — this sequence was 
linked to the original data set. The number of calls per male was 
unchanged. The randomized set was then subjected to DFA, 
resulting in an average correct assignment (see Feng et al., 2009b 
for details). 

For DFA, we first loaded the data from all uncorrelated 
sound parameters (Riede, 1997; Feng et al. 2009b). Next, to gain 
insight into the parameters which contributed the most to 
individual signatures, we performed analyses comparing the 
relative contributions of frequency-domain versus time-domain 
parameters, and the relative contributions of linear parameters 
versus NLP. 


3. Results 


We recorded a total of 1770 vocal calls from nine male O. 
graminea. Their vocalizations were diverse, consisting of two 
major classes: one-note calls and multi-note calls. One-note calls 
could be further divided into four sub-types on the basis of their 
spectral and temporal characteristics: complex/steep-FM calls 
that display complex and steep FM with 3-5 higher harmonics 
(Figure 2a) which shall be referred to simply as steep-FM calls 
from here on; flat-FM calls with 5-7 higher harmonics (Figure 
2(b)); ultra-short (< 20 ms) low frequency flat-FM calls with 
2-3 harmonics (Figure 2f); low-frequency (< 5 kHz) broadband 
calls (Figure 2g). Multi-note calls could similarly be divided 
into several subtypes on the basis of the spectrotemporal 
characteristics and number of notes (Figures 2c-e, h-j). 

The majority of calls (68.0%) were one-note calls — these 
were emitted at various periods during the nightly chorus. In 
contrast, multi-note calls were heard only after chorus had 
built up later in the evening. The duration of one-note calls 
ranged widely, from 20 to 190 ms with a mean of (58.1 + 352) 
ms. Distribution of call duration was skewed toward the lower 
end of the range (Figure 3a). The remainders of recorded vocal 
signals (32.0%) were multi-note calls. The inter note interval (or 
signal break duration) for these calls had a normal distribution 
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Figure 2 Spectrograms of vocalizations of O. graminea (Fuxi, China, 26 
April-28 May 2015) show that their calls are diverse, comprising four 
subtypes of one-note calls (a: one-note complex/steep-FM call; b: one- 
note flat-FM call; f: ultra-short low-frequency flat-FM calls; g: low-fre- 
quency broadband calls) and six subtypes of multi-note calls (c: two-note 
complex/steep-FM call; d: two-note flat-FM call; e: two-note combina- 
tion call; h: two-note low-frequency broadband call; i: three-note call; j: 
four-note call). 


and ranged from 10 to 290 ms, with a mean of (142.4 + 58.2) 
ms (Figure 3b). Breaks longer than 500 ms were taken as gaps 
(or intervals) between separate calls. Inter-call intervals ranged 
from a few second to tens of second, and to several minutes at 
nights when chorus was sparse. 

One-note steep-FM (37.7%) and one-note flat-FM calls 
(30.0%) showed marked differences in their basic temporal 
and spectral characteristics (Table 1, Figure 4). Compared to 
flat-FM calls, steep-FM calls showed pronounced and diverse 
frequency modulation patterns and had shorter call duration 
(Figure 4f), higher f, (Figure 4e), and higher f,Max (Figure 
4a), {Min (Figure 4b), fp>Max / f,Min (Figure 4d), as well as 
higher f,Max—f,Min (Figure 4c) [Mann-Whitney U tests, P < 
0.0001]. The higher f,Max / fọMin and f,Max—f,Min basically 
reflected differences in the depths of frequency modulation of 
these two call-types. Additionally, steep-FM and flat-FM calls 
differed with respect to the degree to which NLP was contained 
in the call. Most steep-FM calls (Figures 2a, c)) exhibited 
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Table 1 Quantitative description (mean + SD) of fundamental temporal 
and spectral parameters of one-note steep-FM and one-note flat-FM calls 
of male Odorrana graminea (Fuxi, China, 26 April-28 May 2015). Mann- 
Whitney U tests were used to test the differences in the means between the 
two sub-types of one-note calls. 


Sound Parameter Steep-FM Flat-FM P 

Total duration (ms) 48.8 + 19.6 69.9 + 45.4 < 0.0001 
Average fo (kHz) 4.6+0.9 2.9+0.4 < 0.0001 
foMax (kHz) UM ae i, 51611 0!5) < 0.0001 
foMin (kHz) 2.9 205 2.2 + 0.4 < 0.0001 
foMax - foMin (kHz) 4.64 1.2 1.3 + 0.6 < 0.0001 
foMax / fMin 3.0407 1.7404 < 0.0001 


overt NLP. These calls showed one or more frequency jumps 
with other segments of the call featuring subharmonics and 
biphonations (Figure 1). In contrast, flat-FM calls (Figures 2b, 
2d) were pure-tone like, showing only flat FM, and contained 
no NLP. The volume of data for one-note steep-FM calls was 
the largest as males emitted such calls throughout the evening, 
In light of the dominant presence of one-note steep-FM calls we 
selected them as the subject of further studies. 

We recorded a total of 668 one-note steep-FM calls from 
9 males. Calls of individual males were recorded across many 
evenings. The majority of these calls (n = 577; 86.4%) contained 
>1 NLP (Table 2). Only a small number of calls (n = 91; 13.6%) 
consisted of harmonic segments purely, without any NLP. This 
phenomenon was applicable to all frogs, except frog #17 for 
which more than half of his calls did not contain NLP. 

A prevalent NLP component in steep-FM calls was 
frequency jumps. Most steep-FM calls that contained NLP (476 
out of 577; 82.5%) exhibited one or more frequency jumps. At 
the opposite end, deterministic chaos was the least observed 
NLP component. Only two of 577 calls (0.3%) from two males 
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displayed deterministic chaos; for seven males, none of their calls 
contained deterministic chaos. Subharmonics and biphonations 
were intermediate in terms of their prevalence. 

Table 2 shows that there were inter-individual differences 
in the numbers of occurrence of various NLP, especially those 
of subharmonics (0.9%-95%) and biphonation (2.5%-84.6%). 
The differences were also evidenced in terms of the durations 
of various NLP and of harmonic segments (Table 2). For all 
frogs, more than two-thirds (> 67%) of the duration of the call 
comprised harmonic segments, with the remaining consisted 
of NLP segments. The relative durations of subharmonics 
and biphonation showed particularly large inter-individual 
differences. In contrast, the relative duration of frequency jump 
was not as variable among individual males, and the relative 
duration for deterministic chaos was very low for all frogs. 

Other than individual differences in the numbers of 
occurrence of NLP, steep-FM calls also displayed intra- and 
inter-individual differences in the frequency modulation 
patterns (Figure 5). A common feature of steep-FM calls was 
their general downward FM trend, but the FM pattern and 
the range of frequency excursion (or depth of FM), were 
highly variable. Calls of frog #17 featured first upward and 
later downward FM, and they showed no NLP other than 
frequency jumps (Figure 5d). Calls of other males tended to 
show a constant frequency in the beginning of the call followed 
by downward FM after the midpoint of the call (Figures 5b, 
d). Most calls displayed a downward frequency jump in the 
middle of the call, and the time of its occurrence was individual 
specific. Calls of some individuals showed additional downward 
or upward frequency jumps at other time points (Figures 5a, 
d). Whereas calls of some males showed a short biphonation 
segment at the beginning of the call (Figure. 5{a)), calls of other 
males contained a biphonation segment near the end of the call 
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Figure 3 (a) Distribution of the duration of one-note calls, including one-note steep-FM calls, one-note flat-FM calls, ultra-short low-fre- 
quency broadband calls, and low-frequency broadband call. (b) Distribution of the inter-note interval (i.e., signal break) of multi-note calls. 
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(Figure 5b). 

Quantitative measures of spectral parameters for individual 
males are shown in Table 3. The average f) of one-note steep- 
FM calls was 4.6 + 09 kHz (range: 3.9-5.3 kHz). The fo, along 
with other spectral parameters, appeared to be individual 
specific as there existed large inter-individual differences. 

To investigate whether calls of O. graminea contained 
individual signatures, we carried out Kruskal-Wallis Analysis 
of Rank test of 320 calls. The K-W test revealed significant 
differences between calls from individual frogs (Table 4). Results 
of K-W test further showed that all temporal and spectral 
parameters were significantly different between individuals. 

To determine statistically whether the inter-individual 
variability was greater than intra-individual variability as 
observed qualitatively, we measured the between-(CV,) and 
within-individual coefficients of variations (CV,,). The result 
showed that, for all acoustic parameters, the ratio CV,/CV,, 
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was above 1, indicating that the CV, was greater than the CV, 
(Table 5). 

To determine group membership for calls we performed a 
stepwise forward discriminant function analysis (DFA). Results 
of DFA showed that the average correct assignment of the 
original data set was 54.7%, indicating that on average 54.7% 
of the calls could be correctly assigned to the eight individuals 
(Table 6). This was higher than the chance-level correct 
assignment of 11.3%-13.1%. One-sided paired t-test showed 
that these two probabilities of correct assignment differed 
significantly (P < 0.05). 

To gain insight into the parameters which contributed 
the most to individual signatures, we compare the relative 
contributions of frequency-domain versus time-domain 
parameters, we loaded the two sets of parameters separately 
into the DFA.We found that the mean correct assignment on 
the basis of temporal parameters alone was higher (49.4%) than 


Table 2 Frequency of occurrence of the various NLP (shown on the left-half of the table) and the call duration and the relative durations (mean + SD) of 
the harmonic and various NLP segments (shown on the right-half of the table) in one-note steep-FM calls of Odorrana graminea (Fuxi, China, 26 April-28 
May 2015). Abbreviations: SVL: snout-vent length; n: number of calls; Duration: total duration of calls; Ha: hamornic segment; Fj: frequency jump segment; 


Sh: subhamornic segment; Bp: biphonation segment; Ch: chaotic segment. 


#2 5.62 65 1 wa SS) Ie Bi GS Il 15 42.248°5 66.84 16:7 105+ 13.9 34222 19.0 411:4 0.34 2.2 
#4 527 16 1 63 13 813 3 188 6 375 0O 0 42.349.7 83.2+16.8 494109 4.2+2.8 7.74110 0+0 
#7 544 95 1 Il) sk lI) O sh If) SEM as sty S r EA OES yy SES OES OOE OS 
#9 549 93 10 10.8 66 71 14 151 44 473 0 0 43.7413.481.7415.7 25460 4.3+3.6 1164142 00 
EAUSO On O T E A E O AS sO OSS SSS SS E 2 OSE ONES 00 
#13 5.17 96 1 1 77 80.2 17 177 48 50 0 0 52.44£13.285.14115 24453 3.5428 90411.1 0+0 
WS 202 10l ¢ 68 YO Bil 2 2 a3 ae 0 4444144859+10.7 0443.1 6943.5 6749.6 0+0 
#16 5.11 40 1 25 38 95 38 95 1 25 0 O 43.9494 95.7435 0732 3.3415 0342.0 0+0 
Ply’ eyisy Wey BE) ih) BIS) I Oo ile lei @ 0+0 
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Table 3 This table shows a quantitative description (mean + SD) of the spectral characteristics of one-note steep-FM calls of male Odorrana graminea (Fuxi, 
China, 26 April-28 May 2015). n: the number of calls recorded and analyzed; fy call fundamental frequency. 


Frog ID n Mean f (kHz) foMax (kHz) faMin (kHz) foMax - f¿Min (kHz) foMax / fMin 
#2 65 4.8+1.0 TSN 2.8+0.8 4721 2.8 + 0.6 
#4 16 5310 7.2 +0.7 3.1+0.2 4.141.3 2.5+0.5 
#7 95 SEENI JE EO See aie 5.14+0.8 OS 
#9 93 4.3 + 0.7 7.3 £0.2 2.4+0.5 5.0 + 1.4 3.3409 
#11 56 4921 Se all 2.440.3 S219 TT 
#13 96 5.0+ 1.1 7.9: 1:0 2.4 40.4 5:5410 3.4 +40.7 
#15 101 4.0 + 0.5 SENS Sse te 3.2205 2303 
#16 40 4.5+0.7 7A+ 0.6 2.1+:0,3 5.40.6 3.7 £0.5 
#17 106 4.4 + 0.6 6.40.9 4.0 + 0.9 2.8 + 0.6 


2.3 + 0.4 
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that based on spectral parameters alone (36.9%). Both were 
above chance level. We also calculated and compared the mean 
correct assignments for linear versus nonlinear parameters. 
We found that the nonlinear parameter set yielded a higher 
mean correct assignment (44.1%) compared to that for linear 
parameter set (31.6%). 

Figure 6 shows a plot of male calls in a three-dimensional 
signal space defined by the canonical scores. The figure shows 
that there was overlap among individuals, but the separation in 


space was sufficiently large to allow above-chance assignment 
of the DFA. 
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4. Discussion 


Results of the present study show that calls of O. graminea 
are comprise two major classes (Figure 2). Similar to the vocal 
signals of male (Feng et al, 2009b) and female O. tomota (Zhang 
et al. 2017), spectrotemporal features of their calls are highly 
variable intra- and inter-individual variability, such as one-note 
steep-FM calls, three-note and four-note calls as well as a couple 
two-note calls. In addition, a notable feature of the vocalization 
of male O. graminea is that, compared to the advertisement 
calls of O. tormota (Feng et al., 2009b; Zhang et al., 2017) and 


2.5 3 3.5 4 


J Min (kHz) 


1 2 3 4 5 6 7 


J Max! fMin (kHz) 


Duration (ms) 


Figure 4 Distributions of various basic spectral and temporal parameters of one-note steep-FM calls (in black) and one-note flat-FM calls (in 
grey). (a) distribution of f,;Max; (b) distribution of fMin; (c) distribution of f;Max—f,Min; (d) distribution of f;Max/f,Min; (e) distribution of 


average f0; (f) distribution of the call duration. 
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Table 4 Results of Kruskal-Wallis Analysis of Rank test examining Table 5 Coefficients of variations of call duration and durations of var- 
between-individual variability (n = 320; df = 7). ious harmonic and NLP segments (absolute measure only) of one-note 
steep-FM calls of Odorrana graminea. Abbreviations: Ha: hamornic segment; 
Sound Parameters x2 P Fj: frequency jump segment; Sh: subhamornic segment; Bp: biphonation 
= segment. 
Total duration (ms) 48.637 < 0.0001 
Harmonic segment duration (ms) 85.038 < 0.0001 Parameter Mean CV, Mean CV, CV,/CV,, 
“Harmonic segment duration (%) 134.329 < 0.0001 Total duration (ms) 34.38 + 14.57 (21.28-64.26) 41.29 120 
“Subharmonic segment duration (ms) 46.335 < 0.0001 Duration of Ha (ms) 40.07 + 17.36 (20.84-75.72) 48.25 1.20 
Subharmonic segment duration (%) 47.373 < 0.0001 Duration of Sh (ms) 43.47 + 15.73 (26.12-60.89) 55.34 1.27 
Frequency jump segment duration (ms) 90.624 < 0.0001 Duration of Fj (ms) 43.77 + 13.33 (20.41-64.11) 54.11 1.24 
“Frequency jump segment duration (%) 97.761 < 0.0001 Duration of Bp (ms) 34.29 + 14.28 (14.45-55.46) 47.04 1.37 
‘Biphonic segments duration (ms) 90.624 < 0.0001 Average f, (Hz) 18.49 + 3.10 (14.41-22.28) 20.66 1.12 
Biphonic segments duration (%) 95.577 < 0.0001 foMax (Hz) 13.55 + 3.38 (8.74-19.37) 16.83 1.24 
“Average f, (Hz) 44.761 <0,0001 fMin (Hz) 15.83 + 3.67 (10.28-20.81) 17.91 1.13 
“fyMax (Hz) 100.327 < 0.0001 fpMax-f,Min (Hz) 20.21 + 6.62 (10.58-30.42) 26.29 1.30 
Min (Hz 69.665 < 0.0001 ; ; i 
fMin Ha) other anuran species studied previously (Bee et al, 1999; Bee and 
*fMax — fəMin (Hz) 114.482 < 0.0001 i 
Gerhardt, 2001; Gasser et al., 2009; Bee et al., 2016), the duration 
fMax / fMin 118.23 < 0.0001 


of one-note steep-FM calls consisted of NLP components is very 


Note: ‘indicates the sound parameters used in the discriminant function short (average of 488 ms). As a result, it limits the numbers of 
analysis. 


occurrence of the various NLP, as well as the variety of NLP 
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Figure 5 Distributions of various basic spectral and temporal parameters of one-note steep-FM calls (in black) and one-note flat-FM calls (in 
grey). (a) distribution of f,Max; (b) distribution of f,Min; (c) distribution of f;Max—f,Min; (d) distribution of f,;Max/f,Min; (e) distribution of 
average f,; (f) distribution of the call duration. 
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Figure 6 Vocalizations of male O. graminea (Fuxi, China, 26 April-28 

May 2015) in a three-dimensional signal space defined by canonical 

scores. 
that can show up in the vocal signal. In fact, the numbers of 
occurrence of chaos and subharmonics are very low when 
compared to male O. tormota (Feng et al, 2009b), another anuran 
species that exhibit pronounced NLP in their advertisement 
calls. 

The proportion of calls containing at least one NLP in male 

O. graminea (86.4%) is comparable to that for male O. tormota, for 
whom 93% of vocal signals exhibit NLP (Feng et al, 2009b). For 
anurans, nonlinearities originated either from the vocal folds 
themselves, or from an interactive coupling between the folds 
and the surrounding airspaces, and the allometric relationships 


between body size and vocal apparatus could affect the 
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occurrence and strength of nonlinearities in acoustic signals 
(Cazau et al, 2016). For example, in Darwin's frogs (Rhinoderma 
darwinii), the smaller individuals had higher proportions of 
the relative duration of chaos, suggesting the body size has an 
influence on NLP components (Serrano et al., 2019). The factors 
contributing to the occurrence of NLP in male O. graminea 
remain to be determined empirically. However, the anatomical 
variation of vocal folds between O. graminea and O. tormota 
might contribute to the differences on proportions of NLP 
among them. 

Our study showed that vocalization of male O. graminea 
contained NLP and exhibited individual signatures. The first 
result differs from that of Shen et al. (2011). Their study did 
not uncover NLP in any of the calls recorded from male O. 
graminea. A contributing factor to this discrepancy is the fewer 
number of call-types found in their study. We have observed 
that only select call-types, in particular one-note steep-FM calls, 
contain NLP, and many other call-types do not. The variability 
across individuals is another contributing factor, as calls of some 
male exhibit few or no NLP whereas other males emit calls 
many of which exhibiting prominent NLP. 

It has been suggested that the presence of NLP can 
contribute to individual signatures (Wilden et al., 1998; Fitch 
et al, 2002; Volodina et al., 2006). Results of the present study 
in male O. graminea, as well as results from male (Feng et al., 
2009b) and female O. tormota (Zhang et al., 2017), provide 
support for this principle, as the vocalizations of these frogs all 
carry individual signatures, and for these species the relative 
abundance of NLP components is individual specific. Further, 
results of comparative DFA in the present study directly show 
that NLP contributes more to individual signatures than 


linear sound parameters such as the average and maximum fo 


fMax—f,Min,and the call duration. 


Table 6 This table shows the results of discriminant function analysis. The results were based on correct classification of 320 calls from eight frogs based on 
eight sound parameters. Shown is the correct classification for the original data set as well as of the randomized data set (#1-#5). The one-sided t-test com- 
pares the correct classifications of the original data set with the respective random data sets. Significance level P < 0.05. 


Frog ID i Original Random Random Random Random Random 
date set date set 1 date set 2 date set 3 date set 4 date set 5 
2# 40 65 20 UES 10 10 15 
7# 40 47.5 10 10 5 7.5 15 
9# 40 225 15 15 10 12.5 15 
11# 40 47.5 15 5 10 12.5 TS 
13# 40 B25 20 17.5 12.5 175 12.5 
15# 40 82.5 TS 15 0 15 17.5 
16# 40 62.5 5 75 12.5 10 5 
17# 40 TTD 10 12.5 0 17.5 17.5 
Mean correct assignment (%) 54.7 12.8 eS) iis 12.8 13.1 
One-sided paired t-test t = 4.86 t=5.49 t=5.69 t=5.67 t=5.75 
P=0.001 P=0.001 P=0,001 P= 0,001 


P=0.001 


[No.1 


DFA of calls of male O. graminea have shown that their calls 
are statistically discriminable, as has been shown for various 
anuran species (Bee et al., 2016). Stepwise DFA can correctly 
classify ~55% of the calls to eight individuals. This classification 
level is comparable to that observed in male concave-eared 
torrent frogs (55% for 6 frogs; Feng et al., 2009a), male green 
frogs (52%-100%; Bee et al., 1999) and male aromobatid frog 
(65%-94%; Gasser et al, 2009), but is lower than that of female 
concave-eared torrent frogs (87% for 5 frogs; Zhang et al, 2017) 
and male bullfrogs (92% for 5 males; Bee and Gerhardt, 2001). 
The classification success of O. graminea falls in the lower range 
comparatively. This is not surprising due to the larger number 
of individuals used in the DFA compared to some previous 
studies. Comparison of DFA data in various anuran species 
reveals that there is a trend of decreasing DFA value with an 
increase in the number of individuals sampled. This trend is 
compatible with the conclusion reached by a systematic study 
in little owls in which Linhart and Šálek (2017) demonstrated 
empirically that DFA value is smaller when the number of 
individuals is increased. In addition, because NLP contributes 
more to individual signatures than that of linear sound 
parameters, the shorter duration of one-note steep-FM calls 
containing NLP components can be expected to reduce the 
individual distinctiveness for O. graminea. Lastly, discrimination 
only based on one call type and its properties would probably 
be underestimated because other call types and their signal 
properties might have some potential to allow discrimination 
among male O. graminea (Bee and Gerhardt, 2001). Future 
studies should examine the importance of other acoustic 
features as potential recognition cues. 

For male O. tormota, males seem to have ability to 
discriminate nearby neighbors from strangers acoustically 
(Feng et al., 2009a) due in part to the presence of individual 
signatures in their calls (Feng et al., 2009b). Whether or not 
male O. graminea possess this discrimination ability remains 
to be determined. Our field observations reveal that males O. 
graminea occupy the same calling sites night-in and night-out, 
and their calling sites remain unchanged throughout a single 
reproductive season. They tolerate the presence of neighbor 
vocalizing males. At the peak of chorusing activity, a male 
may be surrounded by 3-5 actively calling males within a 30 
cm radius in 3D. Furthermore, we have also observed several 
cases of male-male physical combat. Thus, they appear to 
display “dear enemy phenomenon” (Fisher, 1954). Validation of 
this hypothesis, however, requires experimental evidence, for 
instance through playing back multiple calls from “stranger” 
versus calls from a familiar neighbor in the field from the 
neighbor's original calling site (Bee et al., 2016). 

We found that whereas one-note steep-FM calls possess 
harmonics in the ultrasonic range and many exhibit NLP, 
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one-note flat-FM calls uniformly do not have energy in the 
ultrasonic range nor contain NLP (Figure 2). This finding raises 
an interesting question. Namely, whether the extension of call 
energy into the ultrasonic range relate to the occurrence of NLP, 
as the appearance of NLP in anurans has so far been observed 
mostly to species which produce ultrasound. In other words, 
are elements of the vocal fold responsible for the production 
of ultrasound shared with those elements responsible for the 
production of NLP, or are these two groups independent? A 
survey of comparative literature in anurans indicates that 
this relationship appears tenuous. Whereas most vocalizations 
of male (Narins et al, 2004; Feng et al, 2009b) and female O. 
tormota (Zhang et al, 2017) as well as those of male O. graminea 
(this study) have harmonics in the ultrasonic range and they 
also exhibit prominent NLP, those of Huia cavit ympanum (Arch 
et al, 2008) show no NLP despite having energy well into the 
ultrasonic range. As large inter- and intra-individual variability 
can obscure the observation of NLP, as evidenced for female 
O. tormota (Shen et al., 2008) and male O. graminea (Shen et al., 
2011), whether the vocalizations of Huia cavitympanum are truly 
devoid of NLP should be reexamined before reaching a safe 
conclusion regarding the relationship between ultrasonic calls 
and occurrence of NLP. In general, further research is needed 
to illuminate the function, the behavioral significance, and the 
mechanisms of production of NLP in various anuran species. 

Beyond contributing to individual signatures, the function 
of NLP remains poorly understood. It has been suggested that 
NLP boost the auditory salience and provide cues to the fitness 
of vocalizers (Wilden et al., 1998; Riede et al., 2000). Exactly 
what are the primary and secondary function of NLP is well 
deserved of a further investigation. 
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